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Abstract 


For  certain  aspects  of  the  calculus  of  variations  and  optimal 
control  theory,  the  paper  of  Kurt  Friedrichs  listed  in  the  title  of  this 
report  has  had  a strong  influence,  both  directly  and  indirectly.  The 
present  paper  is  devoted  to  a brief  historical  survey  of  this  influence, 
especially  in  the  area  of  convex  analysis.  In  addition,  a translation  of 
Friedrichs'  paper  is  appended  to  this  report. 


1.  Introduction.  Recently  Professor  Thames  called  my  attention  to  Hi*- 
paper  of  Friedrichs  referenced  f 1 5 1 , of  which  I was  not  aware,  although 
I was  cognizant  of  various  more  recent  papers  whose  lineage  may  be  traced 
buck  directly  or  indirectly  to  this  paper.  Following  are  some  comments  on 
the  influence  of  this  paper  on  the  calculus  of  variations  and  control  theory. 

No  claim  is  made  as  to  the  completeness  of  the  presented  list  of  references. 
Moreover,  the  author  does  not  pretend  to  be  conversant  with  the  details  of 
all  the  papers  listed  in  this  survey.  Also  appended  to  this  report  is  a 
translation  of  Friedrichs'  paper. 

In  the  variational  study  of  extrema  wide  use  is  made  of  the  fact  that 
there  is  no  qualitative  distinction  between  minima  and  maxima.  Indeed,  if 
•I0  X ■*  R is  a real-valued  function  on  an  abstract  space  X which  has  a 
minimum  m on  X at  a value  xn,  (i.e.,  m r J0(xQ)  s .J0(x)  ^or  x e 
then  J = -J0  is  a real-valued  function  on  X which  has  a maximum  -m  on  X 
at  x0,  (i.e.,  -m  J(xQ)  * J(x)  for  all  x £X).  In  abstract  context,  the  basic 
contribution  of  Friedrichs  [15]  was  to  associate  with  a particular  integral 
variational  functional  J0:  X -♦  R a related,  (variously  called  "reciprocal", 
"dual",  "conjugate")  variational  functional  Jj:Xj  *♦  R , where  Xj  is  a second 
space,  in  general  different  from  X,  such  that  if  JQ  possesses  a minimum  m 
on  X at  a value  xQ  then  Jj  possesses  at  a value  xj  tX]  a maximum  value 
on  Xj  which  is  also  equal  to  m.  In  this  situation,  consideration  of  extre- 
mizing  sequences  for  J()  and  J j yields  both  upper  and  lower  liounds  for  the 
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I'xtri'miim  value  m.  In  the  introduction  to  [15]  Friedrichs  states  that  his 
attention  to  the  problem  was  directed  by  the  paper  |39]  of  Trefftz,  wherein 
there  is  associated  with  the  Ritz  method  a companion  method  to  obtain  both 
upper  and  lower  bounds,  with  particular  application  to  a problem  in  torsional 


rigidity. 
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2.  Relatively  early  influences  of  Friedrichs 1 pap«  r.  Soon  after 
1h('  appearance  of  Friedrichs'  paper  U5]  it  was  applied  to  certain  problems 
in  elasticity  theory  by  Hasu  f5,6|.  The  results  of  [ 15]  were  also  expounded 
in  the  second  edition  of  Courant-Hilbert  [hi,  §9  of  Ch.  IV,  pp.  109-2091, 
which  appeared  in  1931;  indeed,  this  discussion  discriminates  between  the 
non-singularity  aspects  of  Friedrichs'  transformation  that  pertain  to  the 
stationary  character  of  the  involved  functionals,  and  other  (strong  convexity) 
aspects  that  provide  the  extremizing  properties  of  the  funetionals. 

In  his  1942  address  (9;  Sec.  II.  4]  on  variational  procedures  for  the 
solution  of  problems  of  equilibrium  and  vibration,  Courant  again  emphasized 
the  mathematical  concept  of  representing  the  minimum  value  of  a given 
variational  problem  as  a maximum  value  of  an  associated  variational 
problem.  References  to  Friedrichs'  method  also  appeared  in  books  on 
elasticity  theory,  notably  in  the  first  edition  of  Sokolnikoff  [36.  Sec.  711. 

Also  in  the  years  1947-49  there  appeared  papers  [37],  [10],  [11],  [12],  [18] 
presenting  work  of  Prager,  Synge,  Diaz,  Weinstein  and  CIreenberg  on 
variational  problems  related  to  that  of  Friedrichs  [15],  but  in  general 
employing  alternate  methods  of  treatment. 

In  the  1953  English  edition  of  Courant-Hilbert  the  method  of  Friedrichs 
is  again  elaborated  upon  [8ii;  §9  of  Ch.  IV,  pp.  231-242].  Moreover,  there 
is  added  a new  section  on  reciprocal  quadratic  variational  problems 
[ 8 i i ; §11  of  Ch.  IV,  pp.  252-257],  presenting  an  abstract  analysis  of  such 
quadratic  problems.  In  particular,  attention  is  directed  to  work  of  Synge 
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on  a geometric  interpretation  from  which  one  may  deduce  an  estimate  ot  tin 
distance  of  approximations  from  exact  solutions.  For  a discussion  of  Svnge  s 
method  of  the  hypercircle  see  [38],  and  references  presented  therein.  In  the 
general  area  of  bilateral  estimates  of  extremum  values  one  should  also 
include  the  ' intermediate  problem"  work  of  Weinstein,  dating  from  the  late 
inSO's.  In  this  procedure  the  deter mination  of  upper  bounds  of  eigenvalues, 
which  in  many  cases  may  be  computed  by  the  Rayleigh- Ritz  method,  is 
supplemented  by  the  consideration  of  intermediate  problems  with  weakened 
prescribed  conditions,  leading  to  lower  bounds  for  the  eigenvalues,  (see 
Gould  r 17],  and  Weinstein  and  Stenger  [40]). 

In  his  study  of  extremal  problems  for  bounded  analytic  functions 
in  a multiply  connected  domain.  Lax  [23]  utilized  certain  conjugate  variational 
problems  which  in  the  abstract  resulted  from  the  fact  that  by  an  application 
of  the  Hahn-Ranach  theorem  it  follows  that  if  S is  a linear  manifold  in  a 
normed  linear  space  X and  x e X,  then  the  minimum  of  1 1 y-x  | :y c s]  is  equal 
to  the  maximum  of  |i(x)|  for  all  continuous  linear  functionals  £ of  norm  one 
which  vanish  on  S.  At  the  end  of  the  introduction  to  123]  Lax  points  out  that 
this  abstract  principle  includes  the  results  of  the  above  cited  §11  added  in 
the  English  edition  of  Courant-Hilbert,  and  proceeds  with  the  comment: 

"On  the  other  hand,  the  abstract  principle  itself  can  be  regarded  as  a special 
case  of  pairs  of  conjugate  extremal  problems  described  by  Friedrichs.  This 
is  especially  clear  from  an  abstract  version  of  the  Friedrichs  principle  that 
I have  recently  found,  and  which  I hope  to  describe  at  another  occasion. 


Recently  the  author  has  inquired  of  Lax  as  to  his  abstract  version  of  the 
Friedrichs  principle,  but  to  date  has  received  no  reply. 

In  the  1962  book  of  Funk  (16]  there  is  a rather  lengthy  discussion, 
(Ch.  VIII,  pp.  498-531),  of  the  principle  of  Friedrichs  and  its  applications 


in  elasticity  theory. 
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I Ik*  central  role  of  convexity.  In  order  to  facilitate  later  discussion, 

at  this  stage  we  formulate  in  classical  terms  a variational  problem,  which  for 

n=l  reduces  to  the  first  problem  considered  by  Friedrichs  | 1 5] . Consider- 

the  tixed  end-point  problem  of  minimizing  the  integral  functional 

(3.  1)  .1  [y]  f 1 f(x,y(x),  y'(x)  )dx 

J xo 

in  the  class  of  n-dimensional  vector  functions  y(x)  - yjx)),  (c*  = l,...tn), 
which  are  continuous  and  piecewise  continuously  differentiable  on  [xq.xjI, 
with  elements  (x,  y(x),  y ' (x)  ),  xefxo.xj],  in  a given  region  R in  (x,y,r) 

(x,  y i»  . . . , yn,  r j,  . . . , rn)  space,  and  satisfying  the  end -conditions 
(3.2)  y(xG)  = y°,  y(xj)  = y1  . 

rt  is  assumed  that  for  xn  * x « xj  each  of  the  sections  Rx  = t(y,  r):(x,y,  rle  R] 
is  a convex  subset  of  (y,  r)-space.  Moreover,  it  is  supposed  that  on  R the* 
real-valued  integrand  function  f(x,  y,  r)  =•  f(x,  yi,  ....  yn,  r j,  ....  rn)  is 
continuous  and  has  continuous  partial  derivatives  of  the  first  two  orders 


with  respect  to  the  yQ,  ra , (or  = 1,  ....  n),  while  for  (x,  y,  r)e  R the  2n  x 2n  real 
symmetric  matrix 


(a;  3 = 1,  ....  n) 


(3.  3) 

frr 

fry 

__ 

fr«rP 

fr*ys 

fyr 

fyy 

ra 

fvye 

is  positive  definite. 

Under  these  hypotheses  the  vector  equations 

ffr^x.y,  r)  = za, 


fy^x.y.  r)  = 


w 


a 


(3.  4) 


fr  (x,  y,  r)  = z, 
fy  (x,  y,  r)  = w. 


(o  = 1,  ....  n) 


i 


\ 


* 1 « • fi n« ■ ; i 1-1  continuously  differentiable  mapping  of  K onto  a region  A of 
(x , 7 , w ) - space . If 


<3.  5) 


r K(x,z,w) 
y ; Y z,  w). 


Rq,  (x,  z,  w), 

Y ( x , z , w ) , 
a ' 


denote  the  solution  of  (3.5)  for  (x,z,w)  A,  and  $(x,  z,w)  is  defined  by  the 


Legendre  transform 


y=Y(x,z,w',  r R(x,z,w) 

(3.6)  (x,  z,  w)  z r-t  w y-f(x,  y,  r)  | 

(where  in  general  v is  used  to  denote  the  transpose  of  a vector  V),  then  the 
reciprocal  functional  .1  ^ [ z ] of  Friedrichs  [15]  is  the  negative  of  the  functional 

(3.7)  J 2! 7 1 = J x * ®(x,z(x),zl(x))dx  - [z  (xj)y1  - z "(x  )y°|  . 

Jxo 


Consequently,  the  statement  that  on  the  class  of  continuous  and  piecewise 
continuously  differentiable  z.(x),  xe[x0,Xjl  , with  elements  (x,  z(x),  z'(x)  ) r A 
the  maximum  of  .Ij[z]  is  equal  to  the  minimum  of  . 1 0 ( y ] on  the  above  described 
class  of  y(x)  satisfying  (3.2)  is  equivalent  to  the  statement  that  on  the 
respective  classes  the  minimum  of  J2[z]  is  equal  to  the  negative  of  the 
minimum  of  JQ[y]. 

In  the  particular  case  wherein  f involves  y only  linearly,  so  that 
we  may  write 

(3.8)  f(x,y,r)  = g(x,  r)  + k (x)y, 

the  positive  definiteness  of  (3.3)  is  replaced  by  the  positive  definiteness  of 
the  n x n real  symmetric  matrix 
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(8.!>)  frr(x,y,r)  = grr<x,r>  [ r)]  , (a  , 3 1, n). 

Correspondingly,  (.1.4)  is  replaced  by 
(8.  10)  g ,.( x , r)  y. 

with  solution  r R(x,z),  and 

• i r ( x / ) 

(3.11)  $ ( x , 7. ) z ' r - g ( x , r ) | 

Finally,  in  Section  III  of  [15]  Friedrichs  discusses  a quadratic  double 
integral  problem,  for  which  reference  is  made  to  the  appended  translation. 

The  point  of  principal  interest  is  that  under  the  above  described 
differentiability  conditions  the  positive  definiteness  of  the  2 n x 2n  matrix 
(3.3)  throughout  R is  equivalent  to  the  condition  that  f is  strongly  convex 
in  (v,  r)  on  each  section  Rx.  Correspondingly,  the  positive  definiteness  of 
(3.9)  is  equivalent  to  the  strong  convexity  of  f in  r on  each  R - {r:(x,  y,  r)e  Rl 

Now  conditions  involving  the  positive  definiteness  of  the  matrix 
frr^.y.H  are  of  long-standing  in  the  calculus  of  variations.  For  a 
variational  problem  involving  (3.  1)  the  satisfaction  of  this  condition  along 
a given  extremal  arc  is  known  as  the  strengthened,  (or  sufficient  condition 
form  of  the)  Legendre  condition,  and  appears  in  standard  sufficiency  theorems 
of  the  classical  calculus  of  variations.  The  assumption  that  the  matrix  •' 

frr(x,y,r)  be  positive  definite  throughout  the  entire  region  R of  admissible 
elements  has  frequently  been  called  a condition  of  regularity.  In  particular, 
this  condition  implies  a definiteness  of  the  Weierstrass  E -function,  and  the 
lower  semi -continuity  of  the  integral  functional  .lG[y|  in  certain  senses.  For 
a discussion  of  the  significance  of  the  convexity  of  f in  r,  and  indeed  the 


JE- 


recasting  of  the  entire  Hamiltonian  conc  epts  in  terms  of  convexity,  the 

reader  is  referred  to  Young  (41;  Chs.  i\  and  \ in  particular].  In  this 

connection  it  is  to  be  remarked  that  only  quite  recently  has  Cesari  [7|  proved 

that  if  f is  continuous  in  (x,  y,  r)  for  (x,y)  in  a given  closed  subset  A of 

(x.y)-space  and  r arbitrary,  whiled  denotes  the  class  of  absolutely 

continuous  vector  functions  y(x)  = (y^fx)),  xe[xD,  Xj],  with  graph  in  A and 

f(x,y(x),  y'(x))  Lebesgue  integrable,  (finite  or  infinite),  on  [x0,xj],  and  (ni) 

denotes  the  mode  of  convergence  of  sequences  ty*k)(t)}  of  elements  in  T 

(k)'  i 

in  the  sense  of  weak  convergence  of  t.y  (t ) j in  the  Lebesgue  space 
L[x0,  Xj]  and  uniform  convergence  of  [,y^k'(t)3  on  [x0,  xj],  then  a necessary 
and  sufficient  condition  for  the  lower  semi-continuity  of  .ID[y]  with  respect  to 
mode  (m)  at  each  yeT  is  that  for  each  (x,y)eA  the  function  f is  convex  in  r. 

Modern  appreciation  of  the  central  role  of  convexity  in  the  study  of 
extremum  problems  received  initial  impetus  from  the  concept  of  conjugate 
convex  functions  introduced  by  Fenchel  [14],  and  the  fact  that  Fenchel's 
conjugate  correspondence  for  convex  functions  may  be  viewed  as  a generalization 
of  the  classical  Legendre  transformation.  For  a detailed  discussion  of  this 
concept  and  the  fact  that  it  enables  one  to  treat  variational  problems  devoid 
of  differentiability  assumptions,  see  Rockafellar  [31;  Secs.  12  and  16,  in 
particular].  Attention  will  be  limited  here  to  listing  papers  dealing  with 
"dual"  or  "reciprocal"  problems  under  conditions  corresponding  somewhat 
to  the  strong  convexity  assumption  of  Friedrichs  expressed  by  the  positive 
definiteness  of  the  matrix  (3.  3)  in  the  problem  specifically  formulated  above. 
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In  this  connection  there  are  the  papers  of  Hanson  [21],  Pearson  [26,27], 
Kreindler  [22],  Monrl  and  Hanson  [25],  However,  the  work  of  Roekafellar 
as  presented  in  [32,33,34,35]  appears  to  be  more  intimately  related  to  the 
work  of  Friedrichs,  in  particular,  in  these  papers  Roekafellar  is  concerned 
with  generalized  variational  problems  of  Bolza  form,  wherein  various 
restraints  are  incorporated  through  the  use  of  +<*>as  a possible  functional 
value  for  the  involved  "integrand  functions"  and  "boundary  functionals". 

Hager  and  Mitter  [20]  have  recently  considered  dual  problems  that  are  somewhat 
different  from  those  treated  by  Roekafellar  [34],  and  wherein  the  constraints 
are  given  explicitly  by  inequalities.  Also,  for  control  problems  of  Bolza  in 
Hilbert  spaces  Barbu  [2,3]  has  used  methods  of  convex  analysis  and  of 
maximal  monotone  operators  to  obtain  results  corresponding  to  some  of  those 
of  Roekafellar  in  the  papers  cited  above. 

For  a class  of  Bolza  type  problems  involving  functions  which  satisfy 
differentiability  conditions  of  a more  classical  nature  and  a type  of  "semi-local 
convexity",  sufficient  conditions  for  a global  extremum  have  been  given 
recently  by  Ewing  [131.  A somewhat  related  sufficiency  proof  of  ar.  absolute 
minimum  for  a non-parametric  variational  problem  has  been  presented  by 


Reid  [30], 


4.  Other  remarks.  This  section  is  devoted  to  comments  on  two  types 


of  problems  concerned  with  the  Euler  equations  for  variational  functionals 
that  are  reciprocal  in  the  sense  of  Friedrichs  [15], 

Firstly,  in  the  study  of  oscillation  phenomena  for  ordinary  differential 
equations  there  has  appeared  the  concept  of  reciprocal  equations.  Specifically 
for  p(x)  and  q(x)  continuous  real-valued  functions  which  are  non-zero  on  a 
given  interval  I on  the  real  line,  with  the  differential  equation 
(4.  1)  (p(x)y'(x)]'  - q(x)y(x)  = 0 

there  has  been  associated  the  "reciprocal  equation" 

l4-2’  i^z',x)|'  * ifei = “• 

(see  Potter  (28],  and  Parrett  f 4 ] . If 

(4.3)  f(x,y,r)  = | fp(x)r2  + q(x)y2]  , 

then  (4.  1)  is  the  Euler  equation  for  the  variational  integral  (3.  1),  and 

(4.  2)  is  the  Euler  equation  for  the  associated  integral  of  (3.  7),  wherein 

1 1 2,12, 

*<x,  z,  w)  = g l q(x)  w pTx)  7 I • 

For  integral  functionals  (3.  1)  with  integrand  a quadratic  form 

(4.4)  f(x,y,  r)  = r [R(x)r  + Q(x)y]  + y [Q  (x)r+P(x)y] 

in  (y,  r)  = (y  j ,...,  yn>  r j,  ....  r ) and  R(x)  nonsingular,  with  the  canonical 
form  of  the  Euler  equation  in  vector  functions  (u;v)  = (uj,  . . . , un;v  j,  . . . , vn). 
Reid  [29]  associated  a related  problem,  that  he  called  the  "obverse  problem'  , 
and  which  had  the  property  that  (u;v)  was  a solution  of  the  canonical  system 
for  the  original  problem  if  and  only  if  (v;u)  was  a solution  of  the  obverse 


f 


problem.  If  for  the  variational  function  <M.  li  with  i n t * grand  <4  4 x< 
addition  to  the  non-singularity  of  1H\>  th  1 1 tin  2n  x 2n  ni  itrix  f un<  turn 


[g  (x)  l'(x>j 

is  non  - singular  for  all  x then  the  obverse*  system  of  [ 2 f ] is  indeed  tin 
canonical  form  of  the  Euler  equations  for  t ho  corresponding  reciprocal 
problem  of  Friedrichs.  An  alternate  equivalent  form  of  associated  problem, 
wherein  (u;v)  is  a solution  of  the  primal  differential  system  if  and  only  if 
( - v : u > is  a solution  of  the  associated  system,  has  been  introduced  by 
Ahlbrandt  f 1 ] . 

Finally,  it  is  to  be  noted  that  for  a double  integral  non-parametric 


variational  integral 


1-5)  JM  = jjc  F(ZX,  Zy 


) dxdy. 


Haar  [19]  introduced  the  concept  of  an  associated  "adjoint'  problem,  wherein 
an  extremal  surface  for  (4.5)  determines  a cor  responding  "adjoint  extremal 
surface"  of  the  adjoint  problem.  In  particular,  Haar's  basic  assumptions 
on  the  integrand  function  F(p,  q)  involved  the  non-zero  character  of 


FppFqq  - F^  and  F - pFp  - qFq  for  all  admissible  p,  q.  Subsequently, 
Mickle  [24]  showed  that  under  suitable  non-singularity  conditions  the  adjoint 


problem  of  Haar  was  but  one  of  a set  of  twenty-four  problems  associated 
with  (4.5)  through  like  transformations,  and  that  the  twenty-four  transforma- 
tions possessed  certain  group  properties.  As  variational  problems  of  this 
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sort  arc  in  nature-  somewhat  like  the  double  integral  problem  eonsidered  in  the 
last  section  of  [15],  there  arises  the  question  as  to  whether  or  not  there 
exist  pertinent  relationships  between  the  respective  reciprocal  problems 
of  Friedrichs  for  a given  problem  (4.5)  and  an  associated  problem  of  Mickle. 
As  far  as  the  author  is  aware,  such  questions  have  not  been  considered. 
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5.  Translation  of  Friedrichs ' paper. 

\ MKT  He'll ) OF  THK  OAI.C’l  1.1  S OF  VARIATIONS, 

WlIKKKl^  THK  MINIMUM  OF  AN  INTKORAI. 

IS  I’RKSKNTFD  AS  THK  MAXIMUM  OF  ANOTHKH  KXFRKSSION 

Hy  Kurt  Friedrichs,  Aachen 

Presented  by  R.  Courant  in  the  session  of  December  7,  1028. 

In  the  numerical  solution  of  variational  problems  by  the  Ritz  method 
it  is  important  to  estimate  the  accuracy  of  approximation11.  K.  Trefftz  has 
given  a method  --  indeed,  for  the  Dirichlet  problem  and  related  topics  -- 
whereby  the  solution  of  the  calculus  of  variations  problem  is  approxi  ma  ted 
in  such  a manner  that  the  minimum  value  is  approached  from  below. 
Consequently,  through  the  combined  application  of  this  method  and  that  of 
Ritz  the  minimum  value  is  bounded  from  both  sides.  In  the  following  the 
same  goal  is  achieved,  and  in  a manner  more  general  than  that  of  Trefftz. 
Very  generally  one  may  associate  with  a given  minimum  problem  a maximum 
problem,  whose  maximum  value  is  equal  to  the  minimum  value  of  the  original 
problem.  The  underlying  principle  is  essentially  a Legendre  transformation. 
For  example,  to  the  variational  problem  for  the  solution  of  the  potential 
equation  with  prescribed  boundary  conditions  there  is  associated  a problem 
for  the  conjugate  potential  function.  Also,  with  the  aid  of  a Legendre 
transformation,  in  elasticity  theory  one  associates  with  the  principle  of 
"virtual  displacement"  the  so-called  Castigliano  principle  of  "minimum  work 


of  deformation". 
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In  order  to  proceed  with  assurance  that  the  new  formally  constructed 
variational  problem  does  admit  a maximum,  there  must  he  imposed  certain 
strong  conditions  of  definiteness,  which  moreover  frequently  remain 
satisfied  by  modifications  of  the  variational  expression. 

For  numerical  consideration  the  following  fact  is  also  of  importance: 
if  for  the  initial  problem  the  boundary  values  of  the  function  are  prescribed, 
such  is  not  the  case  for  the  associated  problem:  for  it,  the  boundary 
conditions  appear  as  ’natural"  ones . The  first  case  has  the  disadvantage 
that  one  is  restricted  in  the  choice  of  approximation  functions;  in  the  second 
case,  empirically  the  convergence  is  very  slow.  However,  one  now  has  the 
possibility  of  choosing  which  handicap  he  will  accept. 


in 


I . Variational  problems  in  one  variable. 

Firstly,  we  consider  the  problem  of  minimizing  the  integral 

(1)  (Xl  F(u\  u,  xldx, 

^x0 

with  the  functions  u(x>  satisfying  prescribed  boundary  conditions.  Moreover, 
it  is  to  be  remarked  that  the  following  discussion  can  be  carried  out  when 
boundary  conditions  of  another  type  are  present,  but  we  do  not  discuss  this 
aspect  separately. 

41 


The  fact  that  u'(x)  is  the  derivative  of  u(x>  is  formulated  as  an 


individual  equation 
du 


(2) 


dx 


u 


The  boundary  conditions  are  expressed  as: 

(3)  u ( Xo  1 = uc  , u(xj)  = Uj  , or  simply,  u - IT  for  x-xQ  and  x = xj  , 
w-here  u0,  u^  are  arbitrary  given  values. 

Finally,  let 

(4) 

(5) 


F i > 0 , 

u u 


[This  should  read  F’ u , u , > 0] 


A = F i , F - F , > 0 

u u uu  uu 


We  suppose  that  the  minimum  of  (1)  is  provided  by  a function  u(x) 
which  has  continuous  derivatives  of  the  first  two  orders,  and  has  elements 
(u’(x),  u(x),  x)  interior  to  B.  It  satisfies  the  Euler  equation 
d 


(6) 


dx  Fu'  = Fu 


We  now  introduce  new  functions  p(x),  p'(x)  through  the  equations 
F„'  = P • Fu  = P'  • 


(7) 
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by  which,  in  view  of  (5),  u and  u'  arc  determined  as  functions  of  p and  p'. 
With  these  functions  we  construct  a new  integrand 
(8>  ♦(p’.p,  x)  pu’  + p'u  - F(u',u,x), 

for  which 

(9)  4>pi  u,  4>p  u'  . 

As  a new  variational  problem  we  now  consider 

C x _ xi  5) 

( 10 ) 1 ♦dx  - pu  = Min. 

J x0  xo 

B ) 

on  the  class  of  all  continuous  functions  p(x)  and  p'(x)  , not  restricted  bv 

boundary  conditions,  and  subject  only  to  the  subsidiary  equation 


We  maintain  that  since  u(x)  and  u'(x)  are  elements  of  a solution  of  the 
minimum  problem  involving  (1)  the  corresponding  functions  p(x',  p'(x)  of 
(7)  are  elements  of  a minimum  for  the  expression  (10).  First  of  all,  in 
view  of  (6)  equation  (11)  is  satisfied.  The  new  Euler  equation 


is,  in  view  of  (9),  none  other  than  the  subsidiary  condition  (2),  and  is  also 
satisfied.  The  natural  boundary  condition 
(13)  ♦pi  - u = 0 for  x=xQ  and  x = Xj  , 

is  none  other  than  the  boundary  condition  (3).  Consequently  the  expression 
(10)  is  stationary.  That  it  is  indeed  a minimum  follows  from  the  positiveness 

of  the  second  variation  of  (10)  for  all  (p'(x),  p(x),  x)  in  the  region  P . In  view 
of  (4),  (5),  from  (9),  (7)  one  obtains 


4 , 


2 1 


(14  ) 


* , 


p'p'  A f u'u' 


> 0 


(15) 


, , $ - <f“  . 

P P PP  PP 


> 0 


The  minimum  value  for  the  new  problem  (10)  is 

,X|  x | ,x. 

( (pu'tp’u-F)dx-pu  | r - I Fdx, 
J x0  xo  J xo 


which  is  the  negative  of  the  minimum  value  for  problem  (1). 


■d 


r I . App<‘n<lic“S 

1.  If  the  strong  definiteness  condition  (5)  is  not  satisfied,  but  the 

greatest  values  of  -F  and  |l<’uuil  are  sufficiently  small  in  relation  to  the 

7) 

smallest  value  of  K , i , the  condition  (5)  can  be  attained  by  the  addition 


of  an  integral 


G(u,  x)dx 


with  G appropriately  chosen  --  perhaps  of  the  form  G 3 (a  x + 3 ) u “ . 


2.  Without  knowledge  of  the  existence  of  a solution  of  (1),  it  follows 

that  the  sum  of  the  greatest  lower  bounds  of  (1)  and  (10)  is  non-negative, 

B ) 

Indeed,  for  arbitrary  independent  admissible  functions  u(x),  p(x)  with 
u'  r du/dx  , p'  = dp/dx  , 


( 1 [F(u 


) + $ (p)]dx  - pu  | x ' 


fv  1 [F(u)  - F(v)  + vF  ( v ) + v'F  i ( v ) - uF  (v)  - u'F  ,(v)]dx, 

/ X-.  14  14  14  14 


wherein  we  set  v = $pt  , v'  = $p  and  p = Fut(v)  , p'  = Fu(v).  The  integrand 
is  none  other  than  the  second  order  remainder  in  the  expansion  of  F(u)  with 
respect  to  u',  u in  a neighborhood  of  v\  v,  which  in  view  of  (4),  (5)  is  non- 
negative and  equal  to  zero  if  and  only  if  u = v,  u'  = v ' ; that,  is,  if  and  only  if 
u(x)  and  p(x)  are  solutions  of  the  respective  problems. 

3.  We  consider  also  the  degenerate  case  F’uu  = F , = 0,  wherein 
we  may  set 


F = G(u\  x)  + k(x)u. 


4 


23 

Here  we  require  only  the  condition  (4);  that  is,  G t i > 0 . The  Legendre 
transformation  now  becomes  simply 

(17)  Ku>  r p,  $(p,  x)  pu'  - C ; , $p  u', 

where  we  may  express  u'  in  terms  of  p and  conversely.  The  function  p(x) 
thus  arising  from  a solution  u(x)  of  (1)  renders  (10)  a minimum  in  the  class 
of  all  functions  p(x)  satisfying  the  subsidiary  condition 

(18)  = k . 
dx 

(p1  does  not  appear  in  the  expression  (10)  ).  Indeed,  (18)  permits  only 
variations  of  the  form  p+constant,  and  the  first  variation  of  (10)  is 

v*  - “ l*1  ■ 0 ■ 

J xo  x° 

in  view  of  $>p  = u',  while  the  second  variation  is  positive  in  view  of  (4). 

4.  When  we  distinguish  for  a variational  problem  the  constraint 
conditions,  u'  is  the  derivative  of  u,  u assumes  given  boundary  values,  and 
the  variational  conditions  : Euler  equation,  natural  boundary  (transversality) 
conditions,  the  preceding  discussion  yields  the  result  that  the  constraint 
conditions  of  the  original  problem  are  the  variational  conditions  of  the 
transformed  problem,  and  conversely. 

5.  Method  of  Lagrange  multipliers.  The  preceding  transformation 
of  variational  problems  may  be  generalized  when  one  employs  the  method 
of  Lagrange  multipliers.  The  minimum  problem  (1)  with  the  restraints  (2), 

(3)  is  replaced  by  an  unrestrained  variational  problem:  render  stationary 
the  expression 

1 = |xq  ( F + p<x>  (d*  ' u’)  1 dx  ■ q(u-u ) I**  . 


(ltt) 


24 


where  the  mutually  independent  functions  u(x),  u'(x),  p(x)  and  the  values 
q cjj  are  to  be  varied.  The  first  variation  of  I is 

(20)  51  = /x^l  (Fu'^)6u  4 {h'u'  ' P)6u'  4 (fT7  " u')6p]dx  - [(q-p)6u  Mu-u)&q  |x^  . 

Through  variation  of  p and  q the  equations  (2)  ^ - u'  0, 

(3)  u - u = 0 at  x = xQ>  x xj  are  obtained,  and  through  variation  of  u 
and  u', 

dp 

(2D  Fu  - d5  = 0, 

(22)  Fu,  - p = 0 , 

(23)  q - p 0,  r qo  - p(xQ)  r 0 , q j - p(xj)  - 0]. 

The  original  problem  arises  when  one  initially  imposes  (2)  and  (3)  as 
restraints.  Then  (21),  (22)  and  (23)  are  variational  conditions.  On  the 

other  hand,  if  one  places  these  last  three  c onditions  as  restraints  then  (2) 
and  (3)  appear  as  variational  conditions.  This  is  the  transformed  maximum 

problem.  Moreover,  one  makes  use  of  the  fact  that  this  free  variation  of 
p( x ) is  not  restricted  by  the  conditions  (21),  (22);  however,  this  is  no  longer 
the  case  as  soon  as  u and  u'  are  expressed  in  terms  of  p and  dp/dx  in 
accord  with  (21),  (22),  as  was  supposed  earlier. 

As  soon  as  p(x)  is  introduced  as  a single'  new  function,  the  substitution 
of  (21),  (22),  (23)  in  (19),  and  integration  by  parts,  yields  the  expression 

jx;  [f  • uKu  • u’FU'idx  + Fu,ff  ix„  * 

which  is  the  negative  of  (10). 


4 


Incidentally,  it  is  to  be  noted  that  whenever  only  the  equation  (22) 
and  the  boundary  conditions  (3)  are  prescribed  in  problem  (19)  one  is  lead 
to  the  "canonical  equations"  (2),  (21)  of  the  variational  problem. 

I’o  choose  initially  the  Lagrange  multiplier  rule  has  the  advantage 
that  one  sees  readily  how  to  proceed  if  further  restraining  conditions  of  an 
arbitrary  nature  are  present. 

Added  in  proof.  As  Mr.  Courant  has  remarked,  the  passage  from  a 
minimum  problem  to  a maximum  problem  has  a purely  conceptual  basis. 
This  will  be  discussed  elsewhere. 
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III.  Examples  of  variational  problems  in  two  variables. 
1.  We  consider  first  the  problem 


(24) 


iff  (u2  + u2 
2J)C,  x v 


+ air  Irlxdy  - \1in.,  u-  u on  F,  a > 0 , 


.9) 


where  G is  a region  in  the'  (x,y) -plane  with  boundary  P , and  u(x.  y) 

10) 

is  a solution  of  this  problem 

We  introduce  new  functions  p,  q,  d through  the  equations 
(2F>)  p = ux  , q 

and  consider  the  new  integrand 


uy  , d = au, 


' 7(uv  4 u2  + au2)  = d2  + p“  + q2  )• 


PUX  quy  + du  - j(„x 
Accordingly,  we  produee  the  new  variational  problem 
(26)  ML(^2  + P2+q2)dxdy  - J (px  + qyn)  u ds  = Min. 

C { j 


(xn,  yn  are  components  of  the  outer  normal  and  s arc  length  on  F),  and 
corresponding  to  the  Kuler  equation  of  (19),  as  a single  subsidiary  equation 
we  impose 

(27)  d = py  + qy  . 

The  Euler  equations  for  (26)  are  then 


k (px  + qy’x  - p = 0 . 

(28) 

i(px  *qyly  - q = o . 

and  the  natural  boundary  condition  is 

(29)  7 (p  + q ) - u - 0 on  T. 

a * y 

These  equations  say  that  the  function 
1 

u = a <PX  + qy> 


satisfies  the  following  conditions: 
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uN  ' p,  Uq  - q,  Au  - au  - 0,  u - u on  T. 


Again,  the  minimum  value  c > f (26)  is  equal  to  the  negative  of  the  minimum 
value  of  (24). 


2.  The  Dirichlet  problem 


(30) 


Iff  <U2  + 

2 > ) c.  x y 


)dxdy  = Min.  , u = u on  T 


corresponds  to  the  degenerate  case  II.  3.  Through  the  introduction  of 

1 9 2 

ux  = p,  uy  = q,  <Kp,q)  = 2"(p“  + q ) one  attains  the  variational  problem 
(3D  |jJ(.(p2  + q2)  dxdy  - /r(pxn  + q.yn)  u ds  = Min., 

under  the  subsidiary  equation  px  + qy  r 0.  We  satisfy  this  subsidiary 
equation  by  setting  p = vy  , q = -vx  , and  upon  integrating  by  parts  along 
T obtain  the  problem 


(32)  | j/G(vx  + vy  )dxdy  + /pVUgds  = Min., 

whose  solution  v is  the  potential  function  conjugate  to  u.  The  equation 
Av  = 0 and  the  boundary  condition  vn  = -us  are  satisfied. 

Without  knowing  the  existence  of  a solution  of  (30),  we  can  see 
that  the  sum  of  the  greatest  lower  bounds  of  (30)  and  (32)  is  non-negative. 
This  results  from  the  identity 

JJG<u2  + u2  + Vx  + Vy  )dxdy  4 2 Jpvusds  = ^//(,[(ux-vy)2  + (uy+vx) 


In  the  "free"  problem  (31)  we  can  require  that  the  functions  p and  q 
satisfy  the  associated  Euler  equation  py  - qx  = 0.  We  can  then  introduce 
function  u through  the  equations  ux  = p,  uy  = q,  and  obtain  the  problem 


]dxdy . 
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(82)  4"  J ) ( ■ ( u x 4 uy  * clxdy  “ JpUnu  ds  = Min. 

for  ;ill  functions  satisfying  the  subsidiary  equation  Au  0.  This  is  non* 
other  than  the  method  considered  by  Trefftz.  For  numerical  purposes  it  is 
perhaps  practical  to  retain  this  supplementary  procedure:  however,  whenever 
it  is  possible  to  obtain  without  difficulty  approximation  functions  which  as  a 
matter  of  course  satisfy  the  natural  boundary  conditions  and  not  the  Euler 
equation,  one  would  be  inclined  to  expect  a better  convergence  when  it  is 
replaced  by  the  Ritz  method. 
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1)  | 1 >,  p.  13].  In  general  the  proof  of  the  convergence  of  a minimizing 

sequence  provides  some  estimate  of  the  approximation  to  the  solution 
itself. 


2) 


! 

I 


3) 


» 

' 


4) 


: 

| 


5) 

6) 


[2>,  p.  13].  Kin  Gcgensth'ck  /urn  Rit/schen  Verfahren  Verhandlungen 
des  zweiten  Internationalen  Kongresses  fur  Teehnische  Mechanik, 

7flrich  1927,  P.  131,  where  also  the  numerical  usefulness  is  examined 
in  important  practical  cases.  Pee  in  addition  Konvergenz  und 
Fehlerabsch'dtzung  beim  Hitz.schen  Verfahren,  Math.  Annalen  101(1923), 
where  also  the  pointwise  approximation  to  the  solution  is  deduced  from 
the  approximation  to  the  minimum  value. 

(3),  p.  13].  Indeed,  this  was  the  starting  point  of  the  following  consider- 
ation. See  Riemann,  Weber  and  Frank,  Die  Differential-  u. 

Integra lgleichungen,  II,  V,  §4,2.3. 

[Corrected  version  of  a badly  worded  Footnote  1),  p.  14].  The  integrand 
function  F of  (1)  is  supposed  to  have  continuous  derivatives  of  the  first 
two  orders  with  respect  to  its  arguments  on  a closed  region  R.  The 
functions  u(x)  and  u'(x),  xQ  * x £ xj  , are  supposed  to  be  bounded,  with 
(u'(x),  u(x),  x)  in  B.  Also,  conditions  (4)  and  (5)  hold  in  B. 

(1),  p.  15].  Instead  of  making  the  negative  of  (10)  a maximum. 

[Modified  phrasing  of  Footnote  2),  p.  15].  (p'(x),  p(x),  x),  x * x * x^ 

is  supposed  to  lie  in  a region  B of  the  same  nature  as  B,  and  which 
is  contained  in  the  transform  of  B determined  by  (7);  p'  is  supposed  to 
be  continuously  differentiable. 


7)  (1),  p.  16].  This  corresponds  to  the  Legendre  transformation  of  the 

second  variation. 


8) 

9) 

10) 


[2),  p.  161. 

fD.  P-  19]. 
(2),  p.  19]. 


We  write  F(u),  $ (p)  in  place  of  F(u',  u,  x),  $(p',  p,  x),  etc. 
Perhaps  with  piecewise  continuously  differentiable  tangent. 
It  possesses  continuous  first  and  second  derivatives  on  the 
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